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Abstract — We consider wireless networks in which multiple 
paths are available between each source and destination. We 
allow each source to split traffic among all of its available paths, 
and ask the question: how do we attain the lowest possible 
number of transmissions to support a given traffic matrix? Traffic 
bound in opposite directions over two wireless hops can utilize 
the "reverse carpooling" advantage of network coding in order to 
decrease the number of transmissions used. We call such coded 
hops as "hyper-links". With the reverse carpooling technique 
longer paths might be cheaper than shorter ones. However, 
there is a prisoners dilemma type situation among sources - 
the network coding advantage is realized only if there is traffic 
in both directions of a shared path. We develop a two-level 
distributed control scheme that decouples user choices from each 
other by declaring a hyper-link capacity, allowing sources to split 
their traffic selfishly in a distributed fashion, and then changing 
the hyper-link capacity based on user actions. We show that 
such a controller is stable, and verify our analytical insights by 
simulation. 

I. Introduction 

There has recently been significant interest in multihop 
wireless networks, both as a means for basic Internet access, 
as well as for building specialized sensor networks. How- 
ever, limited wireless spectrum together with interference and 
fading pose significant challenges for network designers. The 
technique of network coding has the potential to improve the 
throughput and reliability of multihop wireless networks by 
taking advantage of the broadcast nature of wireless medium. 

For example, consider a wireless network coding scheme 
depicted in Figure [ija). In this example, two wireless nodes 
need to exchange packets xi and X2 through a relay node. 
On the one hand, a simple store -and- forward approach needs 
four transmissions. On the other hand, the network coding 
approach uses a store-code-and-forward approach in which 
the two packets from the clients are combined by means of 
an XOR operation at the relay and broadcast to both clients 
simultaneously. The clients can then decode this coded packet 
to obtain the packets they need. 

Design and analysis of efficient network coding schemes for 
wireless networks have recently attracted a significant interest 
from the research community. Katti et ah [3] presented a 
practical network coding architecture, referred to as COPE, 
that implements the above idea while also making use of 
overheard packets to aid in decoding. Experimental results 
shown in [3] indicate that the network coding technique can 
result in a significant improvement in the network throughput. 




(b) 



Fig. 1. (a) Wireless Network Coding (b) Reverse carpooling. 

Effros et ah [4] introduced the strategy of reverse carpool- 
ing that allows two information flows traveling in opposite 
directions to share a path. Figure [TJb) shows an example 
of two connections, from ni to 77.4 and from 77,4 to rii that 
share a common path (711,712,713,714). The wireless network 
coding approach results in a significant (up to 50%) reduction 
in the number of transmissions for two connections that use 
reverse carpooling. In particular, once the first connection is 
established, the second connection (of the same rate) can be 
established in the opposite direction with little additional cost. 

The key challenge in the design of network coding schemes 
is to maximize the number of coding opportunities, where a 
coding opportunity refers to an event in which at least one 
transmission can be saved by transmitting a combination of 
the packets. Insufficient number of coding opportunities may 
affect the performance of a network coding scheme and is 
one of the major barriers in realizing the coding advantage. 
Accordingly, the goal of this paper is to design, analyze, 
and validate network mechanisms and protocols that improve 
the performance of the network coding schemes through 
increasing the number of coding opportunities. 

Consider the scenario depicted in Figure [2] We have three 
sources of traffic, each of which is aware of two paths leading 
to its destination. For example. Source 3 (positioned at 715) can 
send packets to its destination (located at tii) at rates x\ and x^ 
on its two available paths. We consider the cost metric of the 
system to be the number of transmissions required to support 
a given traffic matrix. Under the current channel conditions, 
suppose that it is cheaper for Source 3 to send all its traffic on 
path (715, 77.7, Til). However, notice that there is an opportunity 
for reverse carpooling on a subpath (711,712,715). With this 
scheme, node 712 will broadcast coded packets to nodes ni and 
715. We refer to this broadcast link as a hyper-link. Although 
path (715, 712, ni) is more expensive for Source 3, if there is 
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Fig. 2. Increasing Coding Opportunities 



traffic from Source 1 (located at ni) that overlaps with it at 
n2, it might actually be the case that the lowest cost traffic 
split in the system would entail that Source 3 should use the 
hyper-link and send all its traffic using path (715, n2, ni), while 
Source 1 follows suit by using its path (ni, n2, ^5, 77.4). 

However, we immediately see that there is a prisoners' 
dilemma situation here - savings can only be obtained if 
there is sufficient bi-directional traffic on (ni ,77.2,^5). The first 
mover in this case is clearly at a disadvantage as it essentially 
creates the route that others can piggyback upon (in a reverse 
direction). Our challenge in this paper is to design and validate 
a distributed control scheme that provides an incentive to use 
reverse carpooling, eliminates the first-mover disadvantage, 
and hence allows the system to attain the state of lowest 
possible cost to support its traffic. 

A. Related Work 

Network coding research was initiated by a seminal paper 
by Ahlswede et ah [5] and since then attracted a significant in- 
terest from the research community. Many initial works on the 
network coding technique focused on establishing multicast 
connections between a fixed source and a set of terminal nodes. 
Li et al. [6] showed that the maximum rate of a multicast 
connection is equal to the minimum capacity of a cut that 
separates the source and any terminal. In a subsequent work, 
Koetter and Medard [7] developed an algebraic framework 
for network coding and investigated linear network codes for 
directed graphs with cycles. 

Network coding technique for wireless networks has been 
considered by Katabi et al. [3]. The proposed architecture, 
referred to as COPE, contains a special network coding 
layer between the IP and MAC layers. In [8] Chachulski 
et al. proposed an opportunistic routing protocol, referred to 
as MORE, that randomly mixes packets that belong to the 
same flow before forwarding them to the next hop. Sagduyu 
and Ephremides [9] focused on the applications of network 
coding in simple path topologies (referred to in [9] as tandem 
networks) and formulate a related cross-layer optimization 
problems. 

Closest to our problem are [10], [11]. Das et al. [10] 
propose a new framework called "context based routing" in 
multihop wireless networks that enables sources to choose 
routes that increase coding opportunities. They propose a 
heuristic algorithm that measures the imbalance between flows 



in opposite directions, and if this imbalance is greater than 
25%, provides a discount of 25% to the smaller flow. This has 
the effect of incentivizing equal bidirectional flows, resulting 
in multiple coding opportunities. Our objective is similar, but 
we develop iterated distributed decision making that trades off 
a potential increase in cost of longer paths, with the potential 
cost reduction due to enhanced coding opportunities. Marden 
et al. [11] consider a similar problem to ours, but unlike our 
focus on how to align user incentives, their focus is on the 
efficiency loss of the Nash equilibrium attained. Our objective 
is to design an incentive structure that would naturally result 
in the system converging to the lowest cost state. 

B. Main Results 

The key contribution of this research is a distributed two- 
level control scheme that would iteratively try to lead the 
sources to discover the appropriate splits for their traffic among 
multiple paths. On one level are the sources that selfishly 
choose to split their traffic across available multiple paths 
with costs and maximum capacities (set by the hyperlinks) 
on each. On the other level, the hyperlink nodes choose 
maximum capacities for paths that share that node as a 
result of the sources' decisions as well as opportunities for 
network coding. Note by splitting up the dynanucs in this 
fashion, our algorithm is a relaxation of the original cost- 
mininuzation problem. The iteration process continues until 
the entire network has reached local minimum which, since our 
formulation is convex, is also the socially optimal solution. We 
show that this process is asymptotically stable. We illustrate 
our approach as well as the quality of solution using numerical 
experiments. The experiments indicate that: the convergence 
is fast; the costs are reduced significantly upon using network 
coding; more expensive paths before network coding became 
cheaper and shortest paths were not necessarily optimal. Thus, 
the iterative algorithm that we develop from the relaxed 
formulation performs well in practice. 

II. System Overview 

Our objective is to design a distributed multi-path network 
coding system for multiple unicast flows traversing a shared 
wireless network. We assume that the schedule of wireless 
links given to us (for example, using CSMA), and hence 
abstract out the interference between links. We model the 
conmiunication network as a graph GiV^E), where V is the 
set of network nodes and E is the set of wireless links. For 
each link {ni^rij) G E, where {ni^rij) G V, there exists a 
wireless channel that allows node ni to transmit information 
to node rij. Each link (ni^rij) is associated with a cost aij. 
The value of aij captures the cost (number of transmissions) 
of transmitting information at unit rate from rii to nj . Due to 
a broadcast nature of the wireless channels, the node rii can 
transmit to two neighbors nj and Uk simultaneously at a cost 

We assume that the network supports flows {1,2,...,}, 
where each flow is associated with a source and destina- 
tion node. Each flow i is also associated with several paths 



{P/, P-^, . . . } that connect its source and destination nodes. 
Our goal is to build a distributed traffic management scheme 
in which the source node of each flow i can split its traffic, 
Xi, among multiple different paths, so as to reduce the total 
number of transmissions, per unit rate, required to support a 
given traffic. Note that on some of these paths there might be 
a possibility of network coding. 

For example, consider the network depicted on Figure |2] 
The network supports three flows: (i) flow 1 from ni to 77,4, (ii) 
flow 2 from 714 to ne, and (iii) flow 3 from to rii . We denote 
by Xi the traffic associated with flow i, 1 < i < 3. Suppose that 
the packets that belong to flow 1 can be sent over two paths 
(711,712,713,714) and (711,712,715,714). We denote these paths 
by Pi and Pf . The traffic split on paths P/ and Pf is given 
by x\, x\, respectively, such that x\ -\- xf = xi. Similarly, 
flow 2 can be sent over two paths Pj^ = (714, 713, 712, ^e) and 
Pi = (714, 7i8, 7i6) at rates xl and x^, such that X2 +^2 = X2. 
Finally, flow 3 can be sent over two paths P3^ = (715, 717,711) 
and P3 = (715,712,711), at rates xl and x^, with sum X3. 

Note that path Pf = (711,712,715,714) of flow 1 and path 
P3 = (715,712,^1) of flow 3 share two links (711,712) and 
(712,715) in the opposite directions. Thus, the packets sent 
along these two paths can benefit from reverse carpooling. 
Specifically, node 712 can combine packets of flow 1 received 
from node 711 and packets of flow 3 received from node 715. 
Similarly, node 713 can combine packets of flow 1 received 
from node 712 and packets of flow 2 received from node 
714. Note that the cost saving at node 712 is proportional to 
mm{xl,xl}, while the saving at node 713 is proportional to 
min{x^, X2}. Note that our model is not restricted to reverse 
carpooling type XOR coding alone. Other types of XOR 
coding schemes like COPE [12], which uses "opportunistic 
listening" can also be used. 

The cost at node 712 when coding is enabled is 

Cn^ixj.xl) = max{a2i,Q^25}min{x?,x|} (1) 
+0^25(^1 - min{xi, X3}) 
+0^21(^3 - mm{xl,xl}). 

Here, the first term on the right is the cost incurred due to 
coding at node 712. This is because a coded packet from 712 
is broadcast to both destination nodes, 711 and 715, and so the 
cost per unit rate is max{a2i, 0^25}- The second and third term 
are "overflow" terms. Since its is possible that xf 7^ X3, the 
remaining flow of the larger (that cannot be encoded because 
of the lack of flow in the opposite direction) is sent without 
coding at the regular link cost. 

The cost at node 712, given by ([T]), can be re- written as 
shown below: 

Cn2{x\,xl) = ^25^1 +<^2i^2 + | max{a2i , Q^25 } 
-(^21 + a25)| mm{xl,xl}. 

Using the fact that max{xi,X2} + min{xi,X2} = xi -\- X2, 



we obtain 

Cn2{xl,xl) = OL2-yX\^ OL2\x\ (2) 

- min{a2i, 0^25} min{xi, x\\. 

The above equation can be interpreted as the cost at node 
7i2 without coding minus the savings obtained when coding is 
used. Thus, the cost saved at node 712 due to network coding is 
min{a2i, 0^25} min{x^, X3} . Similarly, for node 713 the cost 
saved is min{a32, 0^34} min{x^, X2}. The total system cost 
can be expressed as: 
3 2 

G{X) = ^^^^^lx\ - min{a2i,Q^25}min{x^,x|} (3) 

2=1 i = l 

- min{a32, 0^34} min{xi, ^2}, 

where X = {x\^x\^x\^x\^x\^x^ is the state of the system 
and f5l is the uncoded path cost (equal to the sum of the 
link costs on the path) j used by flow i. For example, 0\ = 
<^12 + <^23 + <^34, for path P\ = (711,712,713,714). Thus, the 
first term on the right in ([3]) is the total cost of the system 
without any coding, while the second and third terms are the 
savings obtained by coding at nodes 712 and 713. 

In this paper, we consider the problem of minimizing 
total cost, given the traffic matrix. The problem poses major 
challenges due to the need to achieve a certain degree of 
coordination among the flows. For example, for the network 
depicted in Figure |2j increasing of the value of x\ (the decision 
made by node 715) will result in a system- wide cost reduction 
only if it is accompanied by the increase in the value of x\. 

III. Hyper-links and System Cost 

In order to decouple the decisions of flows, we introduce 
the idea of a hyper-link whose capacity can be controlled 
independently of the flows that use it. 

Definition 1: A hyper-link is a broadcast-link com- 
posed of three nodes and two flows. A hyper-link 
7i/e[(z,p, 71^), (j, g', 7ij))] at node Uk can encode packets be- 
longing to flow i (sending packets on path p) with flow j 
(sending packets on path q). Here, nodes 71^ and 71 j are the 
next-hop neighbors of nk\ for flow i along path p and for flow 
j along path respectively. 

For each hyper-link 7i/e[(i,p, 71^), (j, 7ij))], we introduce a 
new decision variable that denotes the capacity of the 
hyper-link. This formulation helps us to decouple the coor- 
dination between individual flows. We restrict the total coded 
(broadcast) traffic between the two flows at node Uk to be 
at-most equal to the hyper-link capacity yk- Any remaining 
flow is sent without coding. Referring to Figure [2j there 
exists a hyper-link hi = 7i2[(l, Pf , 715), (3, P3 , 711)], where 
the source node 712 can encode packets of flow (flow 
along path Pf), destined to node 713, with packets of flow 
/I (flow along path P3), destined to node 71 1. Similarly, 
there exists a hyper-link /12 = ^3 [ ( 1 , , ^4 ) , ( 2 , P2^ , 712 ) ] , 
where the source node 713 can encode packets of flow fl, 
destined to node 714, with packets of flow /2 destined to 
node 7i2. Let the hyper-link capacities be defined as y2 and 



ys respectively. The total cost of transmission on hyper-link 

hi = n2[(l,Pi ,715), (3,P3^,ni)] of capacity y2 is given by 

C{hi) = max{a25 , 0^21 }^2 + (4) 
^25(^1 - mm{xl,y2} + 
a2i{xl - min{x|,?/2}, 

where the first term on the right is the cost of sending traffic on 
the hyper-link. Note that we have to bear this cost, regardless 
of whether or not there is enough bidirectional flow to be sent 
on the hyper-link. This relaxation could potentially increase 
the total cost of the system. However, as we will see in 
Section |Vlj we can design a hyper-link capacity controller 
which would adjust the hyper-link capacities periodically to 
minimize cost. As before, the "overflow" packets are sent 
without coding, and the cost incurred in doing so is given 
by the latter two terms. 

The cost at node n2, given by (|4]), can be re- written as: 

C{hi) = a25xl + ^21^3 — T{hi), where 
T{hi) = a25 mm{xl, 7/2} + 0^21 min{x|, y2} 
- max{a25,Q^2i}^2 

Recall that the first two cost terms are the total cost at node 
n2 when coding is disabled. The remaining cost, T(/ii), can 
be thought of as the rebate obtained by using hyper-link 
hi = n2[(l,Pi ,725), (3,P3 ,ni)]. Note that the rebate could 
be negative (hence adding to the total cost), which might 
happen when one of the flow's rate is and the other flow's 
rate is less than the hyper-link capacity. 

Thus, the modified cost function when the system is in state 
(X, F) is given by 

3 2 

= Y^f^l'^ - m^i) + ^(^2)), (5) 

i=i j=i 

where X = {xj, x?, x^, x^, 4, x^}, Y = {^2,^3}. 
T{hi) and T{h2) are the rebates obtained by using 
hyper-link hi = n2[(l, , ns), (3, Pg^, ni)] and /12 = 
n3[(l, Pi , n4), (2, P^,n2)]), respectively. 

In general, the total system cost in terms of number of 
transmissions required to support a given traffic load, when 
the state of the system is (X, F), is: 

C (X, Y) = Total system cost without coding 

— Total rebate of all the hyper-links (6) 

We focus on minimizing this total cost. To this end, we relax 
the problem into two sub-problems-that of traffic splitting by 
sources, and that of hyper-link capacity selection: 

1) Traffic Splitting: In this phase, the source node of each 
flow splits its traffic among the different options, for a 
given hyper-link state Y. The options available to each 
flow are called hyper-paths, where each such hyper-path 
contains zero or more hyper-links. We model this phase 
as a potential game; the background needed is presented 
in Section [IV] Details of our game model and the payoffs 
used are covered in Section |Vl 



2) Hyper-Link Capacity Control: In this phase, we ad- 
just the hyper-link capacities in order to minimize the 
total cost. We use a simple gradient descent controller 
to attain minimum cost. In this phase it is assumed 
the sources attain Wardrop equilibrium instantaneously. 
Further details on the type of controller used and the 
convergence properties are covered in Section |Vl] 

We call our controller as Decoupled Dynamics. The two phases 
operate at different time scales. Traffic splitting is done at 
every small time scale and the hyper-link capacity control is 
done at every large time scale. Thus, sources attain equilibrium 
for a given hyper-link capacities, then the hyper-link capacities 
are adjusted, and this in turn forces the sources to change their 
splits. This process continues until the source splits and hyper- 
link capacities converge. 

IV. Background: Potential Games 

Below we review some game-theoretic ideas that will be 
used in this paper. Detailed discussion may be found in [13]. 
A population game Q, with F non-atomic populations of 
players is defined by a mass and a strategy set for each 
population and a payoff function for each strategy. By a non- 
atomic population, we mean that the contribution of each 
member of the population is infinitesimal. We denote the set of 
populations by jr = where F > 1. The population 

i has mass Xi . The set of strategies for population i is denoted 
Si = These strategies can be thought of as the 

actions that members of i could possibly take. A particular 
strategy distribution is the way the population i partitions itself 
into the different actions available, i.e., a strategy distribution 
for i is vector of the form Xi = {x}^x'^^ ...xf'}, where 
Xlpli = Xi- The set of strategy distributions of a popula- 
tion z G JT, is denoted by Xi = {xi G R^' : Xlpli = Xi}. 
We denote the vector of strategy distributions being used by 
the entire population byX = {xi,X2,...,Xi?}, where Xi G X^. 
The vector X can be thought of as the state of the system. 
Let the space of all strategy distributions be X. 

The marginal payoff function (per unit mass) obtained from 
strategy p e Si by users of class i, when the state of the 
system is X is denoted by i^f (X) G M and is assumed to 
be continuous and differentiable. Note that the payoffs to a 
strategy in population i can depend on the strategy distribution 
within population i itself. The total payoff to users of class i 
is then given by Xlpii ^iO^)x^, where we assume linearity 
for exposition. 

Potential games are a type of population games, that have 
a specific structure on the cost function. The idea behind 
potential games is to identify a scalar function that represents 
the "energy" of the system (exactly like a Lyapunov function 
[14]), which is called the potential function. All information 
regarding the payoffs obtained by users of a population class 
can be captured in the potential function. 

Definition 2: Let ^ be a population game with payoff 
function (per unit mass) F : X ^ IZ^ . Q is called a Potential 
Game if there exists a continuously differentiable function 



T : X such that 

g(X)=Ff(X) (7) 

\/i G !F and p e Si, where X G A' is the state of the system. 
The function T is called the potential function for game Q. 

Next, we define the concept of equilibrium in population 
games. A commonly used concept in non-cooperative games in 
the context of infinitesimal players, is the Wardrop equilibrium 
[15]. Consider any strategy distribution 
There would be some elements which are non-zero and others 
which are zero. We call the strategies corresponding to the 
non-zero elements as the strategies used by population i. 

Definition 3: A state X is a Wardrop equilibrium if for any 
population i e all strategies being used by the members 
of i yield the same marginal payoff to each member of i, 
whereas the marginal payoff that would be obtained is lower 
for all strategies not used by population i. 

Let Si C Si be the set of all strategies used by population i 
in a strategy distribution X. A Wardrop equilibrium X is then 
characterized by the following relation: 

i^"(X) > Ff{X) Vs e Si and e Si 

The above concept refers to an equilibrium condition; the 
question arises as to how the system actually arrives at such 
a state. A commonly used kind of population dynamics is 
Brown-von Neumann-Nash (BNN) Dynamics [16]. The dynam- 
ics are described as follows: 

where, 7^ = max I F[ ^ F^xj^O > 

I ^'.=1 J 

Note that the total mass of the population i is a constant Xi. 
An interesting property of BNN dynamics is non-complacency, 
i.e., it allows extinct strategies to resurface, so that its station- 
ary points are always Wardrop equilibria [13]. 

V. Traffic Splitting: 
Multi-path Network Coding (MPNC) Game 

We model the traffic- splitting process of our Decoupled 
Dynamics controller as a potential game, Q, which we refer to 
as the Multi-Path Network Coding Game (MPNC Game). Our 
system model consists of a set of nodes JV = {ni,...,nAr}, 
where each node G A/" is surrounded by a random number 
of other nodes. The cost of transmission (per unit rate) from 
node rii to its neighboring node rij is a constant and is equal 
to aij, similarly, cost of transmission (per unit rate) from nj 
to rii is aji. There exists a set of flows (these correspond to 
players in the game) = {1, . . . , F}. Each flow, i G ^ is 
defined as a tuple {nf^nf^Xi), where n| G A/" is the source 
node, nf e JV is the destination node, and Xi packets/sec is the 
traffic sent from source to destination. This traffic is equivalent 
to the population mass in the population game interpretation. 
Each flow i is associated with a set of hyper-paths Si . 



Definition 4: A hyper-path p ^ Si between source n| and 
destination nf is a virtual path over a physical path between 
n| and nf. A hyper-path contains zero or more hyper- links on 
it and at each node on the underlying physical path there can 
be at-most one hyper-link. It follows that the set of all paths 
are a subset of the hyper-paths. 

In other words, a hyper-path can have a combination of at-most 
two flows at each node. A flow can split its traffic among the 
hyper-paths available to it, and we denote a sub-fiow ff of 
flow i by the tuple (nf^nf^p^x^). Here, x^ is the traffic sent 
by flow i on hyper-path p. The sum of link costs (per unit 
rate) on the physical path corresponding to the hyper-path is 
denoted Pf. Note that the cost seen by a sub-flow using such a 
hyper-path might be lower than this cost due to saving attained 
by network coding. 

We represent the division of traffic Xi of flow i e J^, over 
all the hyper-paths p G 5^ as a vector, Xi = {xj , . . . , xf' } 
such that Xlnii 

X — Xi, Xi IS called the strategy distribution 
of flow i, and the set of all the strategy distributions of all 
the flows is called the state of the flows and is represented as 
X = [xi . . . xf]- We denote the set of all states of the system 
as X, i.e., X G X. 

The set of all hyper-links in the network is assumed to be 
pre-determined and is represented by7Y = {l,...,i^}, where 
H is the number of hyper-links. Recall that the hyper-link 
formed by encoding packets that belong to flows i and j, for 

j G at node Uk is represented by nk[{ijPjni)^ {j^q^rij)]. 
Nodes rii and rij are the next hop nodes for the hyper-path 
of flows i and j, using hyper-paths p and q respectively. Note 
that we have slightly modified the definition to include the fact 
that i and j are using hyper-paths. We denote by TYf C 7i the 
set of all hyper-links associated with flow ff. 

Each hyper-link can choose its capacity independently 
of others. We denote the capacity of the hyper-link h = 
n/e[(z,p, n^), (j, g', nj)] by yh packets/sec. The hyper-link 
broadcasts packets received at node Uk to rii and rij up to 
capacity i/h- The vector of all hyper-link capacities is called 
the hyper-link state and is denoted by, F = , . . . , yn] • Let 
Y be the set of all possible hyper-link states, i.e., F G Y. The 
state of the system is defined as (X, Y), where X G X is the 
state of the flows and F G Y is the state of the hyper-links. 

In the traffic splitting phase of our algorithm, flows try to 
attain the state of lowest cost for a given hyper-link state 
Y. The hyper-link capacities are controlled in the next phase 
(hyper-link capacity control), discussed in Section |Vl| 

The payoff (per unit rate) obtained in using hyper-path 
p e Si of flow i e when the state of the system 
is (X,F) is denoted by Ff(X,F) G M and is assumed 
to be continuous and differentiable. We may have to make 
suitable approximations on cost functions to ensure that these 
conditions hold. We model our system as a potential game, 
using the total cost function C(X, Y) as our potential function. 
Recall from ^ that the total cost of the system is 

C{X,Y)= EliE'uPf^^-Etinh), (9) 



where 

T{h) = aki min{xf , yn} + akj min{xj, yh} 

-max{aki,akj}yh (10) 

As can be seen from ([T0|, the cost function contains "min" 
terms over the hyper-Hnk capacity and the flow rates, this 
makes the function non-continuous and non-differentiable. In 
order to have a continuously differentiable cost function we 
approximate these "min" terms using a generaHzed mean- 
valued function. 

Let a = {ai, . . . , a^} be the set of positive real numbers 
and let r be some non-zero real number. Then the generalized 
r-mean of a is given by: 



particular option,x^, so as to attain Wardrop equilibrium [15]. 
The source nodes use BNN dynamics ^ to control the mass 
on each option. But since each source tries to minimize its 
payoff, we use a modified version BNN dynamics: 



(20) 



(11) 



The "min" function over the set a is approximated using 

Mr{a) as: 



minjai, . . . , a„} = lim Mr{a) 



(12) 



Substituting for ([TT] instead of the "min" function in ([9] 
we get the approximated total cost function as: 

CiX, Y) = Ef=i Ep=i m - Eti (13) 
where for a hyper-link h = n/e[(i,p, n^), (j, n^)] G 7Y: 

-max{a/ei,a/cj}y/i (14) 

The cost function (7(X, Y) is continuous and differentiable. 
So, we use the approximated cost function as our potential 
function. Thus, it follows from the definition of potential 
games ( |2]) that, the payoff obtained by flow i e in using 
option p e Si is: 



f 1 1 

where, 7^ = max < — ^ ^/^/ ~ ^/ ' ^ ( 



In the next section, we prove the stability of our system using 
Lyapunov theory. 



A. Convergence of MPNC Game 

We show in this section that the multi-path network coding 
game converges to a stationary point when each source uses 
BNN dynamics. We will use the theory of Lyapunov func- 
tions [14] to show that our population game G, is stable for a 
given hyper-link state Y. We use the approximated total cost 



of the system (13) as our Lyapunov function. 

Theorem 1: The system of flows that use BNN dynamics 
with payoffs given by ([T9j is globally asymptotically stable for 
a given hyper-link state Y . 

Proof: We use the approximated total cost function 



C(X, F) ( 13 ) as our Lyapunov function. It is simple to verify 



that the cost function C(X, F), is non-negative is convex, and 
hence is a valid candidate. For a given hyper-link state, Y, we 
define our Lyapunov function as: 



Cy{X)=C{X,Y) 



(21) 



F!{X,Y) 

where, from ( fT4| 

df{h) _ aki 



dC{X,Y) 



VieJ^, peSi (15) From ^ 

hen'' (^6) 



dCyiX) dC{X,Y) 



r-l 



Recall that 



MrixlVh) 



(17) Hence, 



(18) 



dx^, 



F^AX,Y). 



(22) 



F ^Sf dCy(X) ^p 

p 

F ^Sf 



f=l Z^p=l 

EU^%iF^iX,Y)x} 



(23) 
(24) 



Hence, 



Ff(x,y) =/?f-E,eHr^(M;(fc))'"'' (19) 

where is the set of all hyper-links associated with sub-flow 
ff. Note, the payoff is the cost incurred in using an option, 
so the players try to minimize their cost. The source node of 
each flow, i e J^, observes the marginal cost, F[, obtained in 
using a particular option, p e Si, and changes the mass on that 



From (|20l) we can substitute the value for f ^ and we have 



4w = EE^;(-/^r-?E7}) 

/=ip=i i=i 

f=i \p=i \ J p=i J j^i ) 



We define 



where is the global minimum for the primal problem ([30|. 



J p — 1 

=^ E-/(E^/^/-E^/^^I (26) 
<-E^/(E(7PM <0 (28) 



Thus, 



(29) 



where equality exists when the state X corresponds to the 
stationary point of BNN dynamics. Hence, the system is 
globally asymptotically stable. ■ 

B. Efficiency 

The objective of our system is to minimize the total cost 
for a given load vector x = [xi, . . . ,xq] and given hyper- 
link state Y. Here the total cost in the system is C{X^Y) 
and is defined in ([9]). This can be represented as the following 
constrained minimization problem: 

min(7(X,f) (30) 
subject to: 

Si 

= Xi yieJ" (31) 

p=l 

< > 0. 

The Lagrange dual associated with the above minimization 
problem, for a given Y is 



/:^(A,/i,X) = maxmin C(X,r) - (32) 

F Si F Si 



=1 p=i 



i=l p=l 



where and /i^ > , V i G and p e Si, are the dual 
variables. Now the above dual problem gives the following 
Karush-Kuhn-Tucker first order conditions: 



dC 



dx: 



f(A,/i,X*) = y ieJ^ SindpeSi (33) 
and 

/i^xf = y ieJ^ SindpeSi (34) 



Hence from pS\ we have, y i e and y p e Si, 

-xH) 



^^(X^Y) A^^^-^ 

^{x ,r)-A,^ 



dC 

>Ff(x^f) 







p{X\Y) 



A. + /if 



(35) 
(36) 



where the last equation follows from the definition of potential 
games J2l. 

From ( [34| ), it follows that 

Ff(X^f) = Xi whenxf >0 (37) 
and 

Ff(X^f) = A, + /i[ whenxf = (38) 



y i ^ and y p e Si. The above condition ([37 38), implies 
that the payoff on all the options used is identical and for 
options not in use the payoff is more, which is equivalent to 
the definition of Wardrop equilibrium ([3]). Notice, we use a 
modified definition of Wardrop equilibrium, since each source 
tries to minimize it's cost (or payoff). 

We have the following theorem that proves the efficiency 
of our system. 

Theorem 2: The solution of the minimization problem in 



(30) is identical to the Wardrop equilibrium of the non- 
cooperative potential game Q. 

Proof: Consider the BNN dynamics ( [2Q| ), at stationary 
point, X, we have = 0, which implies that either. 



or 

where. 



= Ff(X,y) (39) 
F^ = lT!^^^x\FaX,Y) ViG^, (40) 



The above expressions imply that, all the hyper-paths used by 
a particular flow, i ^ T, will yield same payoff, Fi, while 
hyper-paths not used {x\ = 0) would yield a payoff higher 
than Fi. 

We observe that the conditions required for Wardrop equi- 
librium are identical to the KKT first order conditions ( [37] )- 
(38) of the minimization problem ([3Qj) when 



Fi = \i y i^T 

It follows from the convexity of the total system cost that, there 
is no duality-gap between the primal ( [30| and the dual ( [32j ) 
problems. Thus, the optimal primal solution is equal to optimal 
dual solutions, which is identical to the Wardrop equilibrium. 



VI. Hyper-Link Capacity Control 

Thus far we have designed a distributed scheme that would 
result in minimum cost for a given hyper-link state or capac- 
ities Y and for a given load vector x = {xi,...,^/}. In this 
phase of Decoupled Dynamics, the hyper-link capacities are 
adjusted based on the current system cost so as to guarantee 



a minimum total system cost for a given load vector x. This 
phase runs at a larger time-scale as compared to the traffic 
splitting phase described in Section |V| It is assumed that 
during this phase all the flows or players remain in equilibrium, 
i.e., changing the hyper-link capacities would force all the 
source nodes to attain Wardrop equilibrium instantaneously. 

The hyper-link capacity control can be formulated as a 
centralized convex optimization problem as follows: 



mm 

Y 



subject to. 



H{Y) 

yh>OyyheY and yheH 



(41) 



where, H{Y) is the minimum total cost of the system for a 
given hyper-link state Y, i.e., H(Y) = C{X'^^Y), where, for 
a given Y, is an optimal state of the flows that results in 
minimum costj^We use a simple gradient controller defined 
below: 



Vh -1^- 



dH{Y) 
dyh 



(42) 



The partial derivative, is over the variables yh ^ Y. 

Changing the hyper-link capacity yh, of some hyper-link 
h e H, would result in a different state of the flows, 
and hence a different minimum cost, C(X^,y^), where Y^ 
corresponds to the changed hyper-link capacity of yh while 
other capacities are fixed, as compared to Y. Thus for a hyper- 
link, h = n/e[(/f ,ni), {n.Uj)] with capacity yn. 



dH{Y) 
dyh 



dC 

1 dx, 



^(X*,F)&3) 



dC 



P=l dyh 



where, the last expression follows from the definition of 
( p3] ) and the fact that, for changes in the hyper-link state, the 
sources attain Wardrop equilibrium instantaneously. In other 
words, before and after a small change in yh the system is in 
Wardrop equilibrium. Hence, Ff = Fi \/i e and e Si. 
Finally, Y.pl 
fixed. 



=1 ~ ^ since, the total load Xi = ^pLi is 
dH{Y) 



dyh 



= M\Y) = 



(44) 



where, from (14), for hyper-link = n/c[(/f , n^), (/^, ^ 



^ (h\ — ^ki / Vh ] I Q^fcj / Vh ] 

-max{a/ei,a/cj} 

Recall, MAxly,)=(^<^^] ' 

Theorem 3: At the large time-scale, the hyper-link capacity 
control with dynamics ( [44| ) is globally asymptotically stable. 
Proof: We use the following Lyapunov function 

Z{Y) = V{Y)-V{Y) (45) 

where V{Y) = k.H{Y) (46) 

^Notice, there could be many different states, , which result in a 
minimum cost but the minimum value, C{X'*' , Y), is unique. 



which is strictly convex, with Y is the hyper-link state which 
results in minimum cost H{Y). Differentiating Z{Y) we 
obtain h 

z = E ^y^- (47) 



h=l 



Then from d46l and $44] 



dyh 



dyh dyh 

F -2 

z = y ^ < V r, 

Vh ~ 

h=i 



(48) 
(49) 



with Z = at the stationary points of the system. Thus, the 
system is globally asymptotically stable [14]. ■ 
Finally, it's not hard to show that the equilibrium conditions 
of the controller ( |44| ) are the same as the KKT conditions of 
the optimization problem pT) . Hence, the controller succeeds 
in minimizing the total cost of the system for a given load x 
into the system. Thus, the system state converges to a local 
minimum. Now, since the global cost minimization problem 
under the min approximation is convex, the solution is the 
global minimum of the relaxed problem. 

VII. Simulations 

We simulated our system in Matlab to show system con- 
vergence. We first performed our simulations for our simple 
network shown in Figure |2] The load at the source nodes 1, 
2 and 3 is given as 4.73, 2.69 and 3.56 respectively. We use 
the following costs on the individual links (aij): ai2 = 2.8, 

^23 = 1.6, ^34 = 1.8, ^25 = 1.3, Q^54 = 2.1, ^26 = 1-7, 

^48 = 2.9, ^86 = 2.2, = 1.9, an = 2.6; we assume the 
costs on the links are synmietric. We use the approximated cost 
function ( p3]), w ith a value of r = —100 for the approximation 
parameter (|12|) for our simulations. The simulation is run for 
50 large time scale units, and in each large time scale we have 
20 small time units. 

We compare the total cost of the system for the following 
scenarios: 

1) Decoupled Dynamics: This is the algorithm that we 
developed; we use our hyper-links to decouple the flows 
that participate in coding. 

2) Coupled Dynamics (no hyper-link): Here, there is cou- 
pling between individual flows and coding happens at 
the minimum rate of the constituent flows. We use 
similar game dynamics as that was used in DD. The 
total cost is specified in Equation ([3]). 

3) No Coding: In this system no network coding is used. 

4) LP Optimal: This is a centralized solution. We formu- 
lated our system as a Linear Program (LP) of minimizing 
cost ^ over X and Y for a given load vector that we 
obtain using an LP- solver. 

As seen in the Figure [3] the total cost of the system (number 
of transmissions per unit rate) for our model (decouples using 
hyper-link) is close to the optimal solution obtained by solving 
it in a centralized fashion. 




10 20 30 40 50 

Time (Large Time Scale) 



Fig. 3. Comparison of total system cost (per unit rate), for different systems: 
DD, CD and non-coded against LP. 



Variable 


x{ 












2/2 


2/3 


LP 


2.69 


2.04 


2.69 


0.00 


0.00 


3.56 


3.56 


2.69 


DD 


2.37 


3.35 


2.67 


0.01 


0.02 


3.54 


3.24 


2.49 


CD 


4.70 


0.02 


0.07 


2.61 


0.03 


3.52 


NA 


NA 



TABLE I 

Comparison of state variables for LP and DD and CD. 



For this simple network we compared the final system state 
of DD and CD with that of the solution obtained using LP. We 
observe from Table [ll| that the values for the split (X) and the 
hyper-link capacities (Y) generated by DD are near-optimal, 
but CD is very different. 

Next, we perform our simulations on a bigger topology 
shown in Figure |4] This network consists of 30 nodes shared 
by 6 flows. Flows 1, 2, 3 and 6 have two hyper-paths and 
flows 4 and 5 have three hyper-paths. There are 6 hyper-links 
in the system. Our results are shown in Table |Il| We observe 
that DD performs near-optimally and significantly ouperforms 
CD in terms of total cost. 

VIII. Conclusion 

We consider a wireless network with given costs on arcs, 
traffic matrix and multiple paths. The objective is to find 
the splits of traffic for each source across its multiple paths 
in a distributed manner leveraging the reverse carpooling 
technique. For this we relax to problem into two sub-problems, 
and propose a two-level distributed control scheme set up as 
a game between the sources and the hyperlink nodes. On one 
level, given a set of hyperlink capacities, the sources selfishly 
choose their splits and attain a Wardrop equilibrium. On the 
other level, given the traffic splits, the hyperlinks may slightly 
increase or decrease their capacities using a steepest descent 
algorithm. We construct a Lyapunov function argument to 
show that this process asymptotically converges, although 
performed selfishly in a distributed fashion. 



case 


LP 


DD 


CD 


1 


1293.3437 


1298.3194 


1325.0618 


2 


1550.4593 


1563.7340 


1625.06315 


3 


1624.6393 


1638.4021 


1642.8801 


4 


1826.8595 


1837.1489 


1865.6998 



TABLE II 

Total system cost comparison of Decoupled dynamics(DD) and 
Coupled dynamics (CD) against the LP solution. 




Fig. 4. Network topology. 



We performed several numerical studies and found that our 
two-level controller converges fast to the optimal solutions. 
Some of the bi-products of our experiments were that: more 
expensive paths before network coding became cheaper and 
shortest paths were not necessarily optimal. In conclusion, 
from a methodological standpoint we have a distributed con- 
troller that achieves a near-optimal solution when the individ- 
uals are self-interested. 
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